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What is claimed is: 

1 . A method for testing output quality from a data extraction process, comprising: 

receiving input data containing information to be inserted into a database; 

dividing the input data into a plurality of batches such that a subset of the input data is 
5 duplicated among the plurality of batches; 

distributing the plurality of batches to a plurality of data entry clerks, wherein each data 
entry clerk processes one of the plurality of batches and converts data from the batch 
into worked data; 

O receiving the worked data from each of the plurality of data entry clerks; and 

. 5*1: 

jj) inspecting the subset of the worked data corresponding to the duplicated subset of the 
JU input data to determine the accuracy of the subset of worked data. 

fp 2. The method for testing output quality from claim 1 , wherein the step of inspecting 

: , predicts the quality of the worked data. 

H 3. The method for testing output quality from claim 1 , wherein the subset of the 

H input data duplicated among the batches is based on a sampling plan. 

4. The method for testing output quality from claim 1, further comprising repeating 
the steps of dividing, distributing, receiving and inspecting, if a desired level of accuracy 
is not reached. 

5. The method for testing output quality from claim 1 , further comprising adjusting 
20 the desired level of accuracy based on inspecting the subset of the worked data. 

6. The method for testing output quality from claim 1 , wherein the step of inspecting 
the subset of the worked data comprises: 

identifying the subset of the worked data resulting from the duplicated subset of the 
input data; 
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comparing entries made by each of the plurality of data clerks on the subset of the 
worked data; and 

flagging the entries that differ. 

7. The method for testing output quality from claim 1 , wherein the step of inspecting 
5 the subset of the worked data comprises: accepting the worked data for submission to a 

database if the desired level of accuracy is met and rejecting the worked data for 
submission to the database if the desired level of accuracy is not met. 

8. The method for testing output quality from claim 1 , wherein the input data is a 
plurality of technical product data sheets. 

f| 9. The method for testing output quality from claim 1 , wherein the steps of dividing, 
A distributing, receiving and inspecting are accomplished with a computer system. 

-jib; 1 

*2 10. A data extraction tool implemented on a computer, the tool comprising: 

CP 

HI a first receiver unit for receiving input data containing information to be inserted into a 

L database; 

ft a data divider unit for dividing the input data into a plurality of batches such that a 

5 subset of the input data is duplicated among the plurality of batches; 

a distributor unit for distributing the plurality of batches to a plurality of data entry clerks, 
wherein each data entry clerk processes one of the plurality of batches and converts 
data from the batch into worked data; 

20 a second receiver unit for receiving the worked data from each of the plurality of data 
entry clerks; and 

an inspector unit for inspecting the subset of the worked data corresponding to the 
duplicated subset of the input data to determine the accuracy of the subset of worked 
data. 
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1 1 . The data extraction tool implemented on a computer from claim 1 0, wherein the 
inspector unit predicts the quality of the worked data. 

12. The data extraction tool implemented on a computer from claim 10, wherein the 
subset of the input data duplicated among the batches is based on a sampling plan. 

5 13. The data extraction tool implemented on a computer from claim 10, further 
comprising reworking the batch using the distributor unit, second receiver unit, and 
inspector unit, if a desired level of accuracy is not reached. 

14. The data extraction tool implemented on a computer from claim 10, further 
comprising adjusting the desired level of accuracy based on the inspector unit 
10 inspecting the subset of the worked data. 

S 1 5. The data extraction tool implemented on a computer from claim 1 0, wherein the 
j| inspecting of the subset of the worked data performed by the inspector unit comprises: 

ft identifying the subset of the worked data resulting from the duplicated subset of the 

input data; 

B comparing entries made by each of the plurality of data clerks on the subset of the 
In worked data; and 

H flagging the entries that differ. 

16. The data extraction tool implemented on a computer from claim 10, wherein the 
inspecting of the subset of the worked data performed by the inspector unit comprises: 

20 accepting the worked data for submission to a database if the desired level of accuracy 
is met and rejecting the worked data for submission to the database if the desired level 
of accuracy is not met. 

17. The data extraction tool implemented on a computer from claim 10, wherein the 
input data is a plurality of technical product data sheets. 

25 
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1 8. A computer program for a data extraction tool, the computer program embodied 
on a computer readable medium for execution by a computer, the computer program 
comprising: 

a code segment that receives input data containing information to be inserted into a 
5 database; 

a code segment that divides the input data into a plurality of batches such that a subset 
of the input data is duplicated among the plurality of batches; 

a code segment that distributes the plurality of batches to a plurality of data entry clerks, 
wherein each data entry clerk processes one of the plurality of batches and converts 
10 data from the batch into worked data; 

S a code segment that receives the worked data from each of the plurality of data entry 

€z clerks; and 

hi 

S a code segment that inspects the subset of the worked data corresponding to the 

LJ1 duplicated subset of the input data to determine the accuracy of the subset of worked 

t§ data. 

U 19. The computer program for a data extraction tool from claim 1 8, wherein the code 

0 segment that inspects the data predicts the quality of the worked data. 

20. The computer program for a data extraction tool from claim 1 8, wherein the 
subset of the input data duplicated among the batches is based on a sampling plan. 

20 21 . The computer program for a data extraction tool from claim 1 8, further 
comprising reworking the batch using the code segment that distributes, the code 
segment that receives, and the code segment that inspects, if a desired level of 
accuracy is not reached. 

22. The computer program for a data extraction tool from claim 1 8, further 
25 comprising adjusting the desired level of accuracy based the code segment that 
inspects inspecting the subset of the worked data. 
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23. The computer program for a data extraction tool from claim 1 8, wherein the step 
of inspecting performed by the code segment that inspects comprises: 

identifying the subset of the worked data resulting from the duplicated subset of the 
input data; 

comparing entries made by each of the plurality of data clerks on the subset of the 
worked data; and 

flagging the entries that differ. 

24. The computer program for a data extraction tool from claim 1 8, wherein the step 
of inspecting performed by the code segment that inspects comprises: accepting the 
worked data for submission to a database if the desired level of accuracy is met and 
rejecting the worked data for submission to the database if the desired level of accuracy 
is not met. 

25. The computer program for a data extraction tool from claim 1 8, wherein the input 
data is a plurality of technical product data sheets. 
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